Hierarchical Skill Learning for High-Level Planning
نویسنده
چکیده
Many of the existing techniques for controlling goaldirected agent behavior use one of two primary approaches: heuristic-search planning (HSP) or reinforcement learning (RL). Each has its advantages and disadvantages. For instance, heuristic-search planning does not traditionally learn from previous experience, and can only be applied in domains for which a complete domain model exists. On the other hand, reinforcement learning often performs poorly in new situations until it has gained enough experience to learn an effective policy, and it is difficult to scale RL up to large, complex domains. Both RL and HSP tend to work poorly in domains that require long action sequences. Heuristic-search state-space planning is intractable in such domains, because of the very large search spaces, and reinforcement learning may require exponentially many execution traces to converge. For agents with only low-level primitive actions, such as moving limbs, this makes it intractable to solve problems in complex domains. In order to address both the unique and the shared problems of HSP and RL, I propose a new research direction called skill bootstrapping (SB). The goal of SB is to provide an integrated learning and planning architecture that can improve its performance over time in complex domains. An SB agent starts with a basic set of primitive actions (and their preconditions and effects) as its model of the world. Over the course of solving numerous problems by applying HSP to the primitive actions, SB identifies recurring subgoals, for which it uses RL to create skills that can be applied within the HSP process to solve these subgoals more efficiently. Subgoals can be set by a human supervisor, by a request from the environment, or through a process of exploration. The skills behave as partial policies that can be used reactively, without lengthy deliberative reasoning. Once a new skill is learned, it becomes available for use by the planner along with the other primitive actions, allowing for more compact plans. Additionally, just as future plans can use learned skills, future skills may be built upon lower-level skills. Over the course of the agent’s experience,
منابع مشابه
Learning by imitation: a hierarchical approach.
To explain social learning without invoking the cognitively complex concept of imitation, many learning mechanisms have been proposed. Borrowing an idea used routinely in cognitive psychology, we argue that most of these alternatives can be subsumed under a single process, priming, in which input increases the activation of stored internal representations. Imitation itself has generally been se...
متن کاملLearning Control of Complex Skills Committee in Charge: Learning Control of Complex Skills Abstract Learning Control of Complex Skills
This dissertation presents a hierarchical controller which can learn to perform complex motor skills. Humans routinely coordinate many degrees of freedom smoothly and eeortlessly to achieve complex goals. Moreover, we are good at learning new patterns of coordination to produce new skills. Robots and artiicial systems, on the other hand, typically have diiculty with the kinds of behaviors that ...
متن کاملMedical students points of view on clinical skill lab function in 8 month period
Introduction. Considering rights of patients and presence of a need for a standard clinical skills learning setting in recent 3 decades, it seems necessary to encourage decision – makers of medical education to establish clinical skills laboratory, in order to minimize unnecessary practices and possible injuries and reduce stress over students. CLS in TUMS was established in 2001. In planning...
متن کاملAutomata Guided Hierarchical Reinforcement Learning for Zero-shot Skill Composition
An obstacle that prevents the wide adoption of (deep) reinforcement learning (RL) in control systems is its need for a large amount of interactions with the environment in order to master a skill. The learned skill usually generalizes poorly across domains and re-training is often necessary when presented with a new task. We present a framework that combines methods in formal methods with hiera...
متن کاملToward the Autonomous Acquisition of Robot Skill
The design and coordination of independent specialized skill units (often called action primitives) is fundamental to modern robotics. However, a robot that must act in a complex environment over an extended period of time should do more than just use existing skills: it should learn new skills that increase its capabilities and facilitate later problem solving. Although robots exist that can l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010